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Web Content Adaptation Proce ss and System 

Technical Field 

The present invention relates to an apparatus and method for adapting web page 
5 content for display on an intended display device, by splitting the content into a plurality of 
smaller web pages. 

Back ground to the In vention and Prior Art 

To deliver web content to different devices is a process of understanding, re- 
10 structuring and tailoring the content in such a way that the content source can be 
understood and delivered to different devices (such as desktop PCs, PDAs, and mobiles 
phones) in a manner which suits the device characteristics. Within the prior art (see for 
example Current Technologies for Device Independent, Mark H Butler, HP Labs Technical 
Report HPL-2001-83 4 April 2001), there are three presently known ways of doing it: 
15 Firstly, web developers/authors can use web page development software to tailor 

the content manually to suit different devices, at the web content development stage. By 
doing this, different versions (e.g. HTML/CSS, WML, XML/XSL) of a single source can be 
created based on the device capabilities. This approach is the primitive way to deliver web 
content to different devices, and is a time-consuming and tedious task for web 
20 developers/authors if a large number of versions are required. 

A second more automated approach is to use a proxy-based trans-coding approach 
through a proxy server which does the adaptation work on the fly when an end user submits 
an URI link through a HTTP request. This approach is computing intensive at the proxy 
server, and has the result that the system response time is slowed. Furthermore, there is 
25 no intervention of the original web developers/authors to the adapted web content, which 
may raise legal and copyright issues in some countries. 

A third known technique is to use a client-based (end user device) adaptation 
approach by installing the adaptation system software at the client side. The client-based 
adaptation system will adapt the web content on the fly after it receives the result sent back 
30 by the requested web server. This approach is computing intensive at the client side, which 
will consume and degrade the client processing performance. Again, there is no 
intervention of the original web developers/authors to the adapted web content, which might 
raise legal and copyright issues as well. Furthermore, this approach is not possible to be 
applied in small mobile devices due to computation power limitations. 
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As part of the adaptation, it may be necessary to split a page of web content into a 
plurality of smaller pages (known as content splitting), in view of the small size of client 
display devices such as PDAs and WAP phones. For example, to render the contents of a 
PC web-page (e.g. 800x600 pixels) to a smaller display (for example a PDA with 240x320 
5 pixels) in a readable form, it is desirable to adapt the content to fit into the fewest number of 
smaller pages whilst also trying to minimize the amount of white space. 

Prior known methods for content splitting include those described in US Patent 
Application No. 09/942.051 (published as US 2003/0050931 AD. entitled "System, 
Method and Computer Program Product for Page Rendering Utilizing Transcoding" 

10 This document describes a system which is able to adapt web content for display on 
different viewing devices by splitting the content into multiple pages. The system firstly uses 
the web content to construct a hierarchical tree structure using XML, which is then 
formatted for display on the device (eg by changing to a text font which is supported by the 
viewing device, and replacing redundant information with references to variables). The 

15 formatted structure is then split into multiple pages for output to the viewing device. 
However, whilst such an approach achieves the objective of splitting the content to fit the 
client display device, it does not satisfy the desirable objective of minimizing the amount of 
white space. 

In view of the above, there is a need for a further approach which adapts web page 
20 content for display on an intended display device which does not possess the 
disadvantages of the prior art, and in particular with regards to attempting to minimize the 
amount of white space which will be displayed with the content. 

Summary of the Invention 

25 In order to meet the above, the present invention provides an apparatus and 

method for adapting web page content for display on an intended device. Here, an 
integrated process of splitting combined with transformations are provided in an iterative 
manner to adapt the content display size. Thus allowing the content to be split into a 
suitable number of smaller web pages whilst keeping to a minimum the amount of white 

30 space that will be shown on the pages. In this context it is understood that the term 
"transformation" can include, for example, reducing / increasing the size of images / text, 
removal / replacement of content, etc. 

According to a first aspect of the present invention, there is provided an apparatus 
for adapting web page content for display on an intended display device, comprising 
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adaptation means for splitting the content into a plurality of smaller web pages for display on 
said device, the adapation means being arranged in use to: 

(i) split the content into a plurality of content portions, and to iteratively repeat steps 
(ii) to (vi) for at least one of the content portions: 
5 (ii) analyse the content to determine whether the size of the content portion is 

suitable for display on said device; 

(iii) if the size of the content portion is not suitable for display on said device, then 
apply a plurality of transformations to the content portion; 

(iv) analyse the transformed content to determine whether the size of the 
10 transformed content portion is suitable for display on said device; and 

(vi) if the size of the transformed content portion is not suitable for display on said 
device then split the content portion into a plurality of further content portions. 

According to a second aspect of the present invention, there is provided a method 
15 for adapting web page content for display on an intended display device, comprising 
splitting the content into a plurality of smaller web pages for display on said device by 
performing the following steps: 

(i) splitting the content into a plurality of content portions, and iteratively repeating 
steps (ii) to (vi) for at least one of the content portions: 
20 (ii) analysing the content to determine whether the size of the content portion is 

suitable for display on said device; 

(iii) if the size of the content portion is not suitable for display on said device, then 
applying at least one content transformation to the content portion; 

(iv) analysing the transformed content to determine whether the size of the 
25 transformed content portion is suitable for display on said device; and 

(vi) if the size of the transformed content portion is not suitable for display on said 
device then splitting the content portion into a plurality of further content portions. 

According to a third aspect of the present invention, there is provided a computer 
30 program or suite of programs so arranged such that when executed by a computer system 
it/they cause/s the system to perform the method above. The computer program or 
programs may be embodied by a modulated carrier signal incorporating data corresponding 
to the computer program or at least one of the suite of programs, for example a signal being 
carried over a network such as the Internet 

35 
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Additionally, from a yet further aspect the invention also provides a computer 
readable storage medium storing a computer program or at least one of suite of computer 
programs according to the third aspect The computer readable storage medium may be 
any magnetic, optical, magneto-optical, solid-state, or other storage medium capable of 
5 being read by a computer. 

Brief Description of the Drawings 

Further features and advantages of the present invention will become apparent 
from the following description of an embodiment thereof, presented by way of example only, 
10 and by reference to the accompanying drawings, wherein like reference numerals refer to 
like parts, and wherein: 

Figure 1 is a system block diagram illustrating the components of the embodiment 
of the invention, and the signal flows therebetween; 

Figure 2 is a process flow diagram illustrating in more detail how information flows 
1 5 between the components of the embodiment of the invention in operation; 

Figure 3 is a flow diagram for an algorithm to detect characteristics of display 
objects in web content within the embodiment of the invention; 

Figure 4 is a decision tree to detect the functions of display objects in web content 
within the embodiment of the invention; 
20 Figure 5 is a flow diagram illustrating how content transformations can be applied 

in the embodiment of the invention; and 

Figure 6 is a flow diagram illustrating how the process of content splitting is 
performed in the embodiment of the invention. 

25 Description of the Embodiment 

An embodiment of the present invention will now be described with reference to 
Figures 1 to 6. 

Figure 1 is a system block diagram of the system provided by the embodiment of 
the invention. This system consists of 8 sub-components, as described next. The full 
30 operation of the system will be described later. 

Firstly there is provided the client capability discovery module 12. The purpose of 
this module is to discover the end user's device characteristics e.g. type of devices and their 
capabilities such as screen size/resolution supported, processing power etc., and as such 
this module receives information from the end user display device relating to its capabilities. 
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The client capability discovery module 12 passes the end user's device information to the 
Decision Module 14. 

The Decision module 14 contains existing Client Capabilities profile Ids which were 
previously detected or predefined by the adaptation system. Client Capabilities profile Ids 
5 are sets of information relating to display device display characteristics. In use the Decision 
module 14 first compares an end user's device characteristics and capabilities (CC) range 
based on the information sent by Discovery Module with the existing capability profiles. If 
the client capabilities match an existing profile, then the profile Id of the matching profile is 
sent to a content Cache 10 , in which is stored different versions of pre-generated adapted 

10 web content. If the received CC set does not match an existing CC range (i.e. there is no 
existing CC profile which matches the present requesting device capabilities) then the 
Adaptation module 16 will be triggered. Additionally, the adaptation module 16 may also be 
triggered manually to generate different versions of the original web content, without there 
being a specific request from an end-user. 

15 Howsoever the adaptation module 16 is triggered, the module 16 then acts to 

examine the http header of the requested web content, and further acts to control a content 
analysis module 20 to retrieve the requested web content from a web content source store 
22. The content analysis module 20 then acts to analyse the indicated web content from 
the content source store 22, and passes back to the adaptation module 16 a range of 

20 parameters relating to the characteristics of the web content as input values to the 
adaptation module 16. The input parameters received at the adaptation module 16 from the 
content analysis module 20 enable the adaptation module 16 to adapt the requested web 
content from the original web content stored in the web contents store 22. The output of the 
adaptation module 16 is therefore an adapted version of the originally requested web 

25 content, and this is sent in an appropriate mark up language such as html to a content 
cache 10 together with a set of client capability (cc) information, being a set of one or more 
characteristics of the display of the device requesting the web content. Such display 
characteristic information was determined by the client capability discovery module 12, as 
previously described. 

30 In addition to the modules mentioned above, there is also provided, as previously 

mentioned, a content cache 10 which acts to store different adapted versions of the original 
web content In some embodiments, the cache 10 may also store client capability 
characteristics, being the set of information relating to the display characteristics of different 
client display device. Also provided is the content source 22, in which the original web 

35 content to be adapted is stored, and a content tidy module 18 which acts under the control 
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of the content analysis module 20 to tidy up the original web content received from the 
content source store 22 prior to the analysis thereof. Furthermore, a customisation module 
24 is also provided which is merely a front end system providing previews of the adapted 
web content from the content cache. The customisation module 24 is an offline module 
5 which allows the author to preview and further customise adapted content. 

Having described the various system modules provided by the embodiment of the 
present invention, the operation of those modules will now be described in further detail with 
respect to Figures 2 to 6. 

The apparatus provided by the present invention would most likely be embodied in 
10 a computer system acting as a web server or the like. Alternatively, the apparatus can be 
embodied in a computer system, or the components (as shown in figure 1) can be 
embodied in separate computer systems but one computer system acts as a web server to 
other computer systems. Howsoever the apparatus is embodied, the computer system not 
only acts as a mere server, but also allows for the web content author to develop the web 
15 content and review it thereon. Moreover, the system also acts to generate different 
versions of the original web content for different intended display devices. In view of these 
functions, there are three distinct modes of operation of the system: a first mode wherein it 
is acting to service user requests for web content, the request having been received over a 
network; a second mode wherein it acts to generate adapted versions of the web content for 
20 different intended display devices in advance of the receipt requests from users for such 
web content; and a third mode wherein adaptation of web content to provide a further 
adapted version can be performed on the fly in response to a user request. Each of these 
modes of operation will now be described. 

Dealing with the second of the above described modes of operation first, imagine 
25 that the system is being used to generate different versions of original web content in 
advance of user requests therefor. For example, this mode could be used during website 
development to provide versions of an original source web content each especially adapted 
for different intended display devices, each with different display characteristics. 

Therefore, when in this mode the first step to be performed is that a plurality of sets 
30 of predefined intended display device display characteristic profiles are created, each set 
having a unique ID, and corresponding to a set of one or more display characteristics, each 
characteristic taking a range of values. For example, a first client capability profile set could 
have fields entitled clientjype screen_resolution, and colourjdepth. A first profile ID CC1 
would by way of example have the value "PC in the clientjype field, the value "800 x 600" 
35 or "1024 x 768" in the screen_resolution field, and "16bits n in the colour_depth field. As a 



200337 25 



30 SEP 2883 



7 



further example, a further client profile set with the ID "CC2" could have the value "PDA" .n 
the clientjype field, the value "200 x 300" in the screenjesolution field, and the value 
"32bit" in the colour_depth field. It will be understood that the above are merely non limiting 
examples of the type of information which can contribute towards the profile sets and that a 
5 large number of different profiles can be easily created by forming a collection with different 
combinations of device characteristics and capabilities. Once created, the client capability 
profile sets are stored in a profile server 26, as shown in Figure 2. This is merely a 
database system which acts to physically store the device profiles. The profile server 26 is 
accessible by the decision module 14 in order to allow the decision module to compare 
10 requesting user display device characteristics with the stored client capability profile sets. 

Having created a plurality of client capability profile sets each with a different 
combination of intended display device characteristics and values, in the mode of operation 
presently described the system acts to then generate an adapted version of the original 
source web content for each of the client capability profile sets stored in the profile server 
15 26. This is performed by triggering the adaptation module 16 to adapt the original source 
web content to match each client capability profile set The adaptation module 16 will 
usually be triggered separately for each client capability profile set, such that on any one 
triggering, a single adapted version of the web content corresponding to a single client 
capability profile set will be generated. The detailed operation of the adaptation module 16 
20 upon triggering is given below. 

The first step performed by the Adaptation Module 16 is to trigger the Content 
Analysis module 20, by passing to it details of the original source web content to be 
adapted. The Content Analysis module 20 then retrieves the original content source from 
the Content source store 22 (which stores all the web content source created by 
25 developers/authors), and passes the retrieved content source to the Content Tidying 
module 18 for conversion to an xHTML file. The function of the Content T.dying module 18 
is to tidy up the structure of the mark-up language (web content) and convert it into xHTML 
structure format xHTML format provides a neat and tidy structure for the Content Analysis 
module 20 to perform the analysis task. The Content Tidying module 18 can be provided by 
30 using 3* party software such as TIDY, available at the priority date from 
http://tidy.sourceforge.net/. As such, no further details of the operation of the content tidying 
module 18 will be given here. The Content Tidying module 18 passes back the tidied 
xHTML file to the Content Analysis module 20. 

Having received the tidied web content, the Content Analysis module 20 then 
35 performs the following tasks in sequential order 
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i) calculates the total and individual pixels and characters of display objects in the 
web content; 

ii) detects the functions of individual objects in the web content (normally they are 
the tags in a web page). For example, an object may possess a styling, structural or display 

5 tag; 

iii) groups single objects based on their structural behaviour (information from 

. object tag); and then 

iv) matches the display object display patterns, and groups them together to form a 
group (performed using a Pattern Matching algorithm). 

10 These four tasks are performed by respective dedicated algorithms, the details of 

which are given next. 

Concerning the first task, the purpose of this is to calculate the pixels/size of 
display objects such as text, image and etc. The algorithm which performs this task will first 
detect the type of display object. The algorithm then applies different analysis logic for 
15 different type of display objects. For example, if the display object is a text object, then it 
gets the length, font style and size and calculate the pixels based on these input. If the 
display object is an image/applet/object, then the algorithm will calculate total pixels based 
on width and height of the object. For the rest of the display objects, the algorithm will 
calculate total pixels based on the width, height and/or width/height attributes set in the 
20 parameter of the object (if it is specified in the HTML content). The exact steps performed 
by this algorithm are shown in Figure 3, and described next. 

Referring to Figure 3, the first step to be performed by the algorithm at step 3.2 is 
that it detects an individual display object within the tidied web content. Then, at step 3.4 an 
evaluation is made to determine whether or not the detected display object is text. If this 
25 evaluation returns positive, such that the detected display object is determined to be text, 
then at step 3.6 the length of every text string within the display object is obtained. Next, at 
step 3.8 a text tag is created for every string, and at step 3.10 the numbers of characters in 
the string determined at step 3.6 is set as an attribute of the text tag. 

Next, at step 3.12 the font and style of every text string is determined, and then at 
30 step 3.14 the size of every text string is also determined. Using this information, at step 
3.16 the height and width of the text string based on their font, style, and size attributes is 
calculated and these calculated height and width values are set as further attributes of the 
text tag for each string at step 3.18. The process for that particular display object which 
was determined to be text then ends at step 3.50, and the process starts once again at step 
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3.2 to detect the next display object in the web content Once all of the display objects have 
been processed by the algorithm, then the algorithm is not repeated. 

Returning to step 3.4, if it is determined herein that the detected display object is 
not text then a second evaluation is performed at step 3.20 to determine whether the 
5 detected display object is an image, applet, or object. If this evaluation returns positive, ,.e. 
that the display object is an image, applet, or object, then processing proceeds to step 3.22 
wherein a further evaluation is performed to determine whether or not the width of the 
image applet, or object is specified. If this is the case then processing proceeds to step 
3.24. If this is not the case, then processing proceeds to step 3.28, wherein the original 
1 0 width of the object is determined, and thereafter processing also proceeds to step 3.24. 

At step 3.24 a further evaluation is performed to determine whether or not the 
height of the detected image, applet or object is specified. If this evaluation returns positive 
then processing proceeds to step 3.26. On the contrary, if the evaluation of step 3.24 
returns a negative, then processing proceeds to step 3.30 wherein the original height of the 
1 5 object is determined. Processing then proceeds from step 3.30 to step 3.26. 

At step 3.26, the width and height attributes of the image, applet, or object as 
determined by the previously described steps are set as the width and height attributes of 
the object tag within the web content. Following this, the processing of that particular 
display object then ends at step 3.50. As before, if further objects need to be processed 
20 then processing begins again at step 3.2. 

Returning to step 3.20, if the evaluation performed therein determines that the 
detected display object is not an image, applet, or object, then processing proceeds to step 
3.32, wherein an evaluation is performed as to whether the width and height of the detected 
display object are specified. If this is the case, then processing proceeds to step 3.34 
25 wherein the specified width and height are set as parameters in the style attribute of each 
control tag for the detected display object. The processing then ends at step 3.50, and may 
be repeated if required as described previously. 

If at step 3.32 it is instead determined that neither the width nor the height are 
specified, then processing proceeds to a further evaluation at step 3.36, wherein it is 
30 determined whether or not the size of the detected display object is specified. If this is the 
case, then processing proceeds to step 3.34 wherein the size is set as a parameter in the 
style attribute of each control type for the object. Again, processing then proceeds to step 
3.50 where it ends, but may be repeated if there are further display objects to process. 

Finally, if the evaluation at step 3.36 returns a negative, then a final evaluation at 
35 step 3.38 is performing to determine whether or not a value of a detected display object is 
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specified and if so then the specified value is set as parameters in the style attribute of each 
control type for the object at step 3.34. On the contrary, if no such value is specified then 
processing proceeds to step 3.40 wherein a default width and height of each control is 
retrieved from memory, which are then set as default values at step 3.34. 

This algorithm therefore acts to determine for each display object within the ttd.ed 
web content size parameters such as the length of a text string, or the width and height of 
an image. This information may then be used in the adaptation process, to be described 
later 

Concerning the second task, a further algorithm is provided to perform this task, as 
> described next. 

First the algorithm to perform the second task pre-defines the function categones of 
single objects from a mark-up language perspective. A Single Object (O) is an element 
embedded in a mark-up language which carries properties of its own such as display styles, 
static or dynamic and structural styles. 
5 We define the following pre-defined categories: 

Information (I) 
Information Title (T) 
Control (C) 
Decoration (D) 
0 Replaceable Navigator (RN) 

Un-replaceable Navigator (UN) 
Replaceable Navigator Title (RNT) 
Un-replaceable Navigator Title (UNT) 
The definitions of these function categories are as follows: 
>5 Information (I) - an object that provides informative displayed content, which is 

important and cannot be replaceable. This object can be text, image, video, audio or any 
object (such as JAVA applet) file. 

Information Title (T) - an object that describes the information object, which can be 
the text header or image with information properties. 
30 Control (C) - an object that is meant for user interactive purposes, such as a button 

(radio or submit), input text area, form, drop down menu, check box, list box etc. 

Decoration (D) - an object that does not play an informative role but is solely for 
improving the effect of visualization. This object can be image or text. 
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Replaceable Navigator (RN) - a Navigator is a URI link object. A Replaceable 
Navigator is a Navigator object that can be replaced by alternative text. It must be an image 
provided with alternative text. 

Un-replaceable Navigator (UN) - as mentioned, a Navigator is a URI link object. 
5 An Un-replaceable Navigator is therefore a Navigator object that cannot be replaced by 
alternative text. It might be text or image without alternative text. 

Replaceable Navigator Title (RNT) - A replaceable navigator title is the informative 
URIIink object which describes a Navigator object. It can be replaced by alternative text. It 
must be an image provided with alternative text 
10 Un-replaceable Navigator Title (UNT) - This is an informative URI link object wh.ch 

describes a Navigator object It cannot be replaced by alternative text. It might be text or 

image without alternative text. 

By providing such pre-defined function categories, the algorithm starts a scann.ng 
and comparing mechanism that analyses the properties of single objects embedded in a 
15 maric-up language (such as HTML). The reasoning of the analysis is based on a decs.on 
tree (scanning and comparison logic sequence), as shown in Figure 4. 

The algorithm begins by scanning the web content mark-up language from top to 
bottom When the scanning process starts, every single object of the mark-up language ,s 
searched, detected and compared with the pre-defined function categories. This companng 
20 process is carried out until end of the mark-up language. 

Within the scanning loop, the algorithm searches for single objects and determines 
their function based on the properties carried by the object. The algorithm stops companng 
the single object (On) properties and searches for the next single object (On + f ) after the 
first single object On has qualified for a particular function category. 
25 Referring to Figure 4, the decision tree process applied by the algorithm .s as 

follows The algorithm starts by first searching for a single object. Once a single object 40 
has been detected, the algorithm then checks at step 4.2 as to whether the detected object 
has hyperlink properties embedded therein. 

If the object has hyperlink properties, then a check is performed at step 4.4 to 
30 determine if the object is replaceable by finding whether there is any alternative text for the 
object If there is alternative text and title properties for this object (as determined by step 
4 6) then this object is categorised as a Replaceable Navigator Title (RNT) 48. Else if there 
is alternative text but no title properties for this object, then the algorithm categorises th,s 
object as a Replaceable Navigator (RN) 46. 
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If there is no aiternative text and title properties for this object, then the algonthm 
categorises this object as an Un-rep«aceab.e Navigator Title (UNT) 52. Else if there is no 
alternative text and no title properties for this object, then the algorithm categorises th.s 
object as an Un-replaceable Navigator (URN) 50. This distinction is evaluated at step 4.16. 
5 The title properties of RNT and UNT are based on the following conditions: 
It has title header properties; and 
It is a display object; and 
It must be URI hyperlink (image or text); and 
It has different styles compared to its adjacent display object. 
10 After comparing the object with the hyperlink properties at step 4.2, 4.4, 4.6, and 

4 16 if the object has not yet been categorised the invention will route the checking log.c for 
non-hyperlink properties. User side interaction properties are the next to be compared. 
The factors that determine if the single object has user side interaction properties are ,f the 
single object is one of the following: button (radio or submit), input text area, form, drop 
15 down menu, check box, or list box, and an evaluation to this effect is made at step 4.8 

If the single object is detected at step 4.8 as having user side interaction 
properties, it will be categorized as a Control (C) 42. Else, the algorithm will further 
compare if it is an object which carries video, class object or audio properties, at step 4.10. 
If it is then this single object will be included in the Information (I) function category 44. 
20 ' If the object has still not been categorised, the algorithm further checks the s.ngle 
object by determining there are decoration properties carried by the single object at step 
4 12 The decoration properties are determined based on the following critena: 

The size of the single object - The size of the single object is derived from an 
experimental value which best represents the size of decoration properties; or 
25 The presence of symbols, lines and separators between the present single object 

(On) and the next single object (On+1 ). 

The object size (width & height) is based on experimental value (subjective value). 
The inventors performed experimental tests on 100 web pages, and our results showed that 
images with pixel sizes width <= 20 and height <=20 tended to be a decoration object. 
30 If the present single object qualifies from the above conditions, .t w,ll be 

categorized as a Decoration (D) function 54. If there are no decoration properties found 
within the single object, the invention will further check for information title properties, at 
step 4.14. 
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Once the single object is determined as not having decoration properties, it will be 
either categorized as Information function or Information Title function at step 4.14. The 
single object will only be qualified as Information Title (IT) 58 based on the following criteria: 
It has title header properties; and 
5 It is a display object; and 

It might be text or image only; and 
It has different styles compare to its adjacent display object. 
If the single object is determined not to have the title properties, it will be 
categorized as an Information (I) function 56. 
1 o Therefore, as will be apparent from the above, based on the scanning process and 

comparing mechanism done by this algorithm, all of the single objects in the tidied web 
content obtain an assigned specific function to represent their role within the mark-up 
language, thus fulfilling the second task performed by the content analysis module 20. 

Regarding the third task, a further algorithm is provided within the content analysis 
15 module to perform this task. The main purpose of this third algorithm is to group content into 
clusters based on their positioning information. Structural tags represent this information. 
The structural tags we recognize and select are: 

<TABLE>, <FORM>, <FRAMESET>, <DIV>, <UL>, <OL>, <DL>, <P>, <PRE>, 
<ADDRESS>, <BLOCKQUOTE>, <Hn>, <HR>, <CENTER>, <MENU>, <DIR>, 
20 <TD> and <NOSCRIPT>; 

which were selected for clustering objects because they are able to group objects 
together visually when the objects are displayed on client browsers. 

The operation of the algorithm which performs this task is simple, and merely acts 
to parse the web content and select objects for grouping on the basis of the presence of any 
25 of the above tags within the object 

With respect to the fourth task, i.e. that of matching the display object display 
patterns, and grouping them together to form a group, a pattern matching algorithm is 
provided, as described next. 

Web pages can be thought of as comprising of a number of content chunks. These 
30 chunks are sets of multimedia objects that relate to particular areas of interest or tasks. If a 
basic object is defined as one that contains a single multimedia element (for example an 
image or a body of text), and a composite object is defined as a set of objects (basic or 
composite) that perform some certain functions together, then a chunk is itself a high-level 
composite object When a web-page is split up into a number of smaller pages it is 
35 important for the intelligibility that the content chunks are not broken up. Thus, before 
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adapting the content, the multimedia objects that make up the page need to be grouped into 
potential chunks. 

Yang and Zhang of Microsoft Research have described a system for locating such 
content chunks, in Yang, Y. and Zhang, H.J. "HTML Page Analysis Based on Visual Cues" 

5 in 6 th International Conference on Document Analysis and Recognition (ICDAR2001), 2001. 
The following paragraphs outline a similar system. Both systems use the HTML tags to 
perform an initial grouping of multimedia objects into possible composite objects, followed 
by application of pattern matching to find possible further groupings. The difference 
between the systems lies in the distance measure used to determine the sim.lanty of 

1 0 various objects and the algorithm for pattern matching. 

Initial grouping of objects 

Before performing the initial grouping of multimedia objects into possible composite 
objects, the HTML document is parsed into an xHTML tree to clean up the HTML tags and 
15 to form an easy to manipulate structure. The xHTML tree consists of HTML tags at the 
nodes and multimedia objects at the leaves. 

The next step involves the construction of a group tree, in which the leaves contain 
multimedia objects and nodes denote composite objects (and so potential content chunks), 
up to the top node which denotes the entire web-page. The xHTML tree is transformed into 
20 a group tree, by first inserting <g> tags directly above a predefined set of HTML tags 
associated with the natural breaks in the content, mainly block level tags, such as <table>, 
<td>, <form>, <center> and <h>. Second, a set of tokens, one for each type of multimedia 
object is defined along with sets of attributes, for example, the number of characters in the 
text string, or the width and height of the image. Third, working from the multimedia objects 
25 at the leaves in the tree, the tokens are passed upwards and all nodes other than the leaf 
and those containing <g> tags are removed. As a token is passed upwards it accumulates 
attributes associated with the nodes, if a node has more than one child then all the children 
receive the attribute associated with it Some formatting tags, such as <tr>, are ignored 
since they to not impose any attributes onto the multimedia elements and unlike the tags in 
30 the predefined set are not usually indicative of a new content chunk. If a <g> tag node has 
more than one child then the tokens arranged in a linear list in the same left-to-right order in 
which the child nodes are arranged. 

By labelling the objects associated with various block-level tags, such as tables 
and cells, as potential groups; the group tree already incorporates the majority of the 
composite objects and so content chunks. This technique assumes, not always correctly, 
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that the <g> tags do. not split any content chunks. However, labelling the contents of 
formatting objects does not distinguish between content chunks which are implied through 
repeated arrangements of similar multimedia objects. Thus, once the group tree has been 
derived, pattern matching is performed on the list of tokens belonging to the child nodes of 
5 each <g> node. 

Pattern Matching 

The first step in the pattern matching process is determining which of the lists of 
tokens in each of the child nodes are similar. Note that each token has a set of attributes 
10 associated with it. Each attribute consists of a type and a value pair, for example (font, 14pt) 
and (width, 100). The values can either be strings or integers. If an attribute type does not 
naturally have a value associated with it then the value is set to a null string, for example 
(bold, ). When comparing tokens, if a particular token does not have any attributes 
associated with it, then it is assigned a special null attribute ( , ) to ensure that the set of 
1 5 attributes is not empty. 

To compare two tokens, a and (3, with the sets of attributes: (t°,V?), i = l...,N a 

and (r/,F/), j = i,...,N p the following similarity measure is used 

where i) (r^)(r',F*)=l if 3 l*j*N» such that T? =Tf and V? =Vf 
20 vfc,rr)fc'.V')-i^.rf)/nm*fK.rf) if 3 l<y ^ such that T? =Tf and both of 
V" and vf are integers 
iii) fc a , v, a Xr" , V p )= 0 otherwise. 

Comparison of lists of tokens is achieved by dynamic time warping (a dynamic 
25 programming algorithm), see table 1 below, in which the alignment path is not allowed to 
wander more than a given number (proportional to the length of the smallest token) places 
off the diagonal and also incorporates a punishment for non-diagonal movements. If the 
sum of the similarity measures along the alignment path is greater than a threshold the two 
lists of tokens are regarded as similar since they are either identical or if there is only a little 
30 variation in their length and composition. 



16 



public boolean Compare (ArrayList A, ArrayList B) 

koat MUtl = new "° atlA - S ^ii + aleragf gasper token 
6 fiT;;« P »« nominal 

for (x=l;x<=A.size() ;x++) 

for (y=l;y<=B.size() ;y++) 

+ s(A.get(x-l) ,B.get(y-D) ; 

15 iI } « Mti .si Z e ( )nB.si,e„]>Min« A . 3 i Z eO.B.si,eO). Allow, 

return true; 
else return false; 

Table 1. JAVA code for the comparison of lists of tokens. 

To detect patterns, a lower trtanguiar matrix, minus the diagona. elements, is first 

namely: 

The pattern must be at least two child nodes in length; and 
The pattern must be repeated at least twice; and 
on Instances of the same pattern should not overlap. 

35 ~^=:=^-— =~= 

ntSS to b e passed to the Adaptation Module. The information which 

has been provided by this process relate to: 

0 Unbreakable groups which are no. supposed to be separated during adaptation 
(i.e. the "chunks" referred to above); 
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D to functtons of groups and single objects, which indicate whether they can be 

the Adaption module when dedding whether/how ,o sp.it the content ,nto smaller pages, 

^ iv) The original structural and styling information of the content source. Here, 
structure means the layout of content Structural information contains codes of how the 
col. ,s arranged and pbsttioned. Style means content objects- (such as text or ,mage) 
width, height colour, font attributes (e.g. font-face and size), etc. 

The format of the XML tree which has been produced by We process . such mat t 
now includes idemiflers (known as <g> tags) which identify where it is allowable to spirt the 
content for display on separate web pages. Mulitple <g> tags have been inserted, as 
discussed earlier, based both on natural breaks infhe content (eg block M tags su 
<teble> <td>,<form>,etc) end also based on appropriate groups of content wh,ch should be 
15 displayed together. The hierarchical tree format means that these <g> tags form a number 
o, nodes indicating where the tree may be split, and therefore me lowest <g> tags (.e mose 
closest to me leaves) indicate mose nodes below which no content splitting <s allowed (,e 
unbreakable groups of content which cannot be spread over different web pages) 

An additional aspect of me XML tree is mat it includes a second type of Mk 
20 These identifiers are labels associeted eimer wim individual content objects (ie single 
multimedia elements such as an image or body of text) or composite objects (a set o 
objects which perform certain functions together). Those objects which are sun liar, for 
example of the same type and wim similar lists of style attributes and other charactensttcs 
(such as image size or number of text characters) are labelled wim the same identifier tag. 
25 During me adaptation process (described tater) these labels an, used to ensure unrform 
treatment of similar objects when they are being processed for display on a web page (eg 
mat all similar images / text boxes are reduced by me same proportton). 

As mentioned, me Content Analysis module 20 now passes me results of me 
analysis ( me XML tree) to me Adaptation module 16. The Adaptation module 16 men 
30 removes all me client capability device profiles available (and which in this mode of 
operation were pre-generated, from me Profit Server 26. The Adaptation module 6 then 
triggers loops which run an algorithm to generate different versions of web content based 
on me profiles available. The number of loops performed will depend on me number o 
profiles available. Essentially, an adapted version of me content is generated for each ckent 
35 capability profile. 
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The algorithms that operate on the XML tree to adapt the content are illustrated in 
the flow charts in Figures 5 and 6. Figure 5 shows the algorithm which checks whether the 
content will fit into the current profile range for the display device. This algorithm cycles 
through a number of stages, checking each time whether the content will fit the page (.e the 
5 display device), and if not then it performs a number of transformations to progress.vely 
reduce the size of the content The algorithm of Figure 5 forms just one of the stages of the 
algorithm shown in Figure 6, which further operates to split the content onto different pages 
(if necessary). 

Starting with the algorithm of Figure 6, the purpose of this is to ensure that «f the 
10 content is too large for display on a single page on the intended display device, then ,t ,s 
split into multiple smaller pages, whilst minimizing the amount of white space on the pages. 
To perform the splitting, the algorithm uses the <g> tags in the XML tree structure. The 
algorithm operates by moving through the tree from node to node (the nodes tagged wrth a 
<g> label) starting from the top <g> node. At each node, the algorithm calculates whether 
15 the content in the sub-trees below it will fit the display, applying transformations to the 
multimedia objects (such as shrinking the font size / image size) as appropriate. If the 
content below the current <g> node will not fit the display, then the algorithm moves down 
to the child <g> nodes and recalculates for each of those (splitting them further .f 
necessary) until the whole web page has been output as multiple smaller web pages. 
20 In more detail, with reference to Figure 6, the steps carried out by the algorithm are 

as follows. Two temporary stores, stacks "Q" and T, are used to temporarily store the 
nodes during processing. A third store, array Trans", is used to store data relating to the 
transformations (such as reduction of the font size, or image size) which are to be applied to 
the various objects in the web page. 
25 The adaptation algorithm commences at step S6.1. Firstly, the algorithm takes the 

top <g> node from the tree and puts it into stack Q (step S6.2). The algorithm also ensures 
that the temporary stack T is cleared (step S6.3). The algorithm then moves to step S6.4, 
which calls the function frts_page(T,Trans), explained in more detail below, to check 
whether the <g> nodes currently stored in T will fit into the client display, and appl.es 
30 transformations to the multimedia objects as appropriate. However, since at this point the 
stack T is empty, fits _page(JJrans) will return TRUE, and processing moves onto step 
S6 5 which is to check whether all the nodes in T have the same parent, using the funcfion 
are_siblings(T). Since T is also empty, this returns TRUE, moving processing onto step 
S6.6, which checks whether there are any nodes remaining in stack Q. Since there are 
35 nodes in Q (the top <g> node is currently in Q). processing moves onto step S6.7 which 
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moves the node from the top of stack Q onto stack T, and processing cycles back to step 



S6.4. 



At this point, the top <g> node of the tree is in stack T, so step S6.4 checks us.ng 
function fits_page(T,Trans) whether this node (ie the web content of all the sub-trees below 
5 it) fit into the client device display, including applying transformations to the objects as 
appropriate. This step involves the algorithm illustrated in Figure 5. At this stage, the 
algorithm is checking whether the entire web page content will fit into the display dev.ce 
(because the node currently stored in T is the top node). If the content is small enough to 
fit the algorithm will terminate by passing through step S6.5 to step S6.6 since there are no 
10 nodes remaining in stack Q, the process jumps to step S6.15 where it outputs the enfire 
tree (using a call to the function render(T,Trans) ), and then ends. 

However, when the entire web page content will not fit into the display device, then 
step S6 4 will be negative, and processing moves to step S6.8 which removes the top node 
from stack T and stores it as node N. Then, at step S6.9, there are still no nodes in stack T, 
15 so processing moves to step S6.10 which checks whether the node N is a leaf. At th.s 
stage, the answer is negative (because the node N is the top <g> node), so processing 
moves to step S6.11 which places all the direct child <g> nodes of node N onto stack Q for 
processing. Then, step S6.12 checks whether there are any nodes for processing in stack 
Q. At this stage, all the direct child <g> nodes of the top XML tree node are in stack Q, so 
20 processing cycles back to step S6.3 which clears the temporary stack T. 

Processing then moves through steps S6.4, S6.5, S6.6 and S6.7 which moves the 
first of the direct child nodes from stack Q to stack T. It may be this portion of the web 
content (ie the content in the sub-trees below this child node) is small enough to fit into the 
client display (ie step S6.4 returns TRUE). In this case processing again passes through 
25 steps S6.4, S6.5, S6.6 and S6.7 to add the second child node from stack Q to stack T. This 
process will repeat indefinitely, checking each time whether this combined portion of the 
content (ie of all the nodes added to stack T) still fits within the client display. If at any point 
the source content (ie the current portion of content represented by the list of nodes in T) 
will not fit the display (ie step S6.4 is negative) then processing moves to step S6.8 where 
30 the last node added to stack T (ie the node highest on the stack) is removed from stack T 
and stored as node N. Step S6.9 then checks whether there are still nodes remaining on 
stack T, and, if yes, processing moves to step S6.13 where the node N is returned to the 
store it came from (ie stack Q) for processing later. Then a section of the XML tree (ie the 
content represented by the nodes currenly in stack T) is output as one of the adapted 
35 smaller pages using a call to the function render(T,Trans). The algorithm continues with 
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S6.12 determining whether there are still nodes remaining in stack Q tor processing, and, if 

ves, moves to step S6.3. tv>rn , tn h 
Following the steps illustrated in Figure 6, the algorithm will Iterahvely wo* through 
the entire tree, transforming the content to see if it can be made to At the display (ie » 
5 function (*J**T.T>-*> and if not men splitting me content even further. This ,s 
achieved in me algorithm by me process ot each time me content below a node does not fit 
into me display device then its direct child <g> nodes are separately added into a store (the 
qU eue in stack Q) in step S6.11 for processing later. This a* to split me content ,nto 
smaller and smaller portions, so that me sub-trees can men be checked individually to see # 
10 me content is a suitable size for display on me client device. For any content portions that 
do fit the display, an attempt is men made to combine mem with other portions of the 
content (ie step S6.7) so as to minimize the amount of white space in me display. However, 
mis combining of different portions of me content is only made if the nodes in question are 
siblings (i.e. have me same parent <g> node In the tree). In this way, me adaptation 
15 algorithm works its way down through the entire tree, from me top <g> node to me bottom 
<g> node leaves (below which no further content splitting is allowed). 

By virtue of hierarchical tree arrangement of me web content in the XML tree, and 
the order of me steps performed by the adaptation algorithm, me top-to-bottom, left-to-nght 
order of me content is maintained when it is displayed. Also, it ensures mat only whole and 
20 related composite objects (of me same level of grouping) are displayed on a angle page. 

In addition, the algorithm ensures consistency of display of simlar objects 
throughout me smaller pages. It achieves mis by the use of a store (eg me array called 
Trans) which maintains a list of transformations which have been applied to the objects, 
together with the object label associated with that object (ie me second type of identifier tag 
25 referred to earlier which indicates similar objects). For each portion of content, me 
algorithm checks array Trans to see if changes have already been applied to objects wh,ch 
were similar, so as to ensure that me transformations applied to these objects are 
consistent The array Trans is dynamically updated during processing, each time the 
function msjageO successfully applies new transformations to a portion of content mat 
30 result in it fitting within me page size. 

It is to be understood mat throughout me description of the adaptation algonthm 
above, reference has been made to two stores, stacks "Q" and T. used to temporarily 
store the nodes during processing. For the purposes of the function of the algorithm, such 
stores can therefore be considered to hold the relevant portions of content associated with 
35 those nodes. This is true notwithstanding me fact that the stores themselves may actually 
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be rented simpiy as a »st c, memo* addles which point to the iocations of ft. 
content in another part of the memory. 

An exempt embodiment o, fte adapted aigodftm (pseudo ,ava oode) tor 

adapting the web page is given below: 

5 

splitPage (top_node) 

{ ^ 0 // Qi 5 astackwhichholdsthenodestobeprocessed 

Sta ^ // Tisastackwhichholdsnodesbeingprocessed 

^ « ' // N is a temporary node 

10 Node H, // ^i^^y^chholdsthetransformauonsappliedto 

ArrayLxst Trans, // ^ Qrd J g to ^ object/gr oup labelling 

Q.push(top_node) // Add the top <g> node in the XML tree to stack Q 
15 while (Q.size()>0) 

while (.T.emptyO) T.popO; // EmptystackT 
while (fits_j>age(T, Trans) && are_siblings <T) ) 

20 if (Q..i»()-0) goto end; // Finished XML free 

11 lu „ // Take the top node off stack Q... 

N=Q.pop(); ...andadditontostackT 

T.push(N) ; " 
} 

25 NT pop() • // Remove top node fxom stack T and store it as node N 

if (T.size<)>0) // CheckswhetherthereareanynodesinstackT 

* // node N is returned to stack Q for processing later 
30 ? r Ss=render(T,Trans); / / output a section of the XML page 
} 

else 

{ vx ^ e r(H Trans)- //If the current node N is a leaf 

35 if <is_leafC*» Trans^endertH,^)^//^^^^^^ 

else 

L each child C of N in ^ cMld <g> nodesofN 

40 {Q.push(C) ; } // oQto stackQ forprocessing 

} 

} 

> v // output the last part of the XML page 

45 end: render (T, Trans) ; " ou * 

} 
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where the following functions are used by SplitPageO: 
is leaf(N) 

5 This function returns TRUE if the temporary node N is a leaf 

are siblinas(T) 

This function returns TRUE if the nodes in stack T share the same parent <g> node 
(or T contains a single node or is empty) 

10 

fits DaaeCT.Trans) 

This function determines whether the nodes in stack T (ie the content of all the sub- 
trees below those nodes) are small enough to fit into the display device. Firstly, if stack T 
contains any nodes with identifier labels the same as those in Trans, then the associated 
15 transformations are applied to those objects. If the content now fits the page then 
ffts_page(T,Trans) returns TRUE. .,„„„„, 
However, if the content does not yet fit the page, then a number of different 
transformations can be applied to the content and these are illustrated in Figure 5. If any of 
these additional object transformations successfully result in the content fitting the display 
20 device then the transformations are added to the array Trans with the appropriate object 
labels, and ffls_page(T, Trans) returns TRUE. 

Alternatively, if the source content will still not fit the display device, even after all 
possible additional transformations have been applied, then ffls_page(T. Trans) w,ll return 
FALSE. 

25 This function will also return TRUE if stack T is empty. 

randerlN.Tn>ns\. render /T.Transi 

This function is used to output a portion of the content (ie the content represented 
by one or more nodes of the XML tree) as one of the adapted smaller pages. The function 
30 either has input parameters of node N and array Trans, or stack T and array Trans. Srnce 
Trans contains the identifier labels for object which have been rendered to date, along With 
those transformations and parameters associated with them, then if N or T contains objects 
whose labels are the same as those in Trans, the relevant transformations are applied to 
those objects. The remaining objects are then transformed as appropriate (ie to fit the 
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intended display device whilst minimizing the white space), and the labels and 
transformations added to Trans. Finally Trans Is returned by the function. 

As mentioned eariier, during the function call rTfcpagefT, Trans), if the content 
5 does not fit *e profile range (eg the intended display device), then a number of different 
transformations can be applied to the content, and these are now discussed ,n more data,l 
witt, reference to Figure 5. The algorithm which carries this out performs » 
checks: i) Check if the total pixels and characters of the source content can be fit ,nto the 

profile range; . . 

10 ii) Check if after removing the blank spaces and lines, source content can be fit into 

the profile range; and 

Hi) Check if removing, resizing, summarizing, and changing propert.es of the 

display object will fit the source content into the profile. 

These checks are embodied by 8 possible transformations. These transformations 
15 are applied in order, but after each transform has been applied an evaluation is performed 

to determined if the content as transformed up to that point can be displayed on the 

intended device, by referring to the client capability profile for that device. If it is determined 

that display is possible then no further transformations are applied, otherwise all 8 

transformations are applied. 
20 The available transformations, in order, are as follows:- 

1- transformation: Font reduction. Here the original font is transformed into a smaller font 

size with "verdana" as the font-family. 

2" d transformation: Image reduction. The purpose is to reduce image objects by 10/o and 
goes into recursion until it reaches the optimum size or 50%. 
25 3- transformation: Control object reduction. The purpose is to reduce objects based on the 
ratio of default screen size and client device if the result is greater than an optimum size of 
the object. 

4* transformation: Space removal. The purpose is to get rid of those unnecessary space 
between paragraphs. 

30 5* transformation: Line removal. Its purpose is the same as the 4 transformation. 

6 th transformation. Decoration image removal. The purpose is to remove images which 
have decoration properties based on objects' size. 

7- transformation: Decoration text removal. The purpose is to remove redundant texts 
which act as decoration if they are special characters. 
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8 m transformation: Image replacement. If there is an alternate text for an image, then the 
algorithm will compare the alternate text size with the image itself. The shorter will be 
selected as the adapted result. 

Figure 5 illustrates the transformation algorithm in more detail, and in particular 
5 illustrates the eight different transforms which may be applied. Referring to Figure 5, the 
procedure provided thereby is started at step 5.1 wherein two counters are initialised. More 
particularly, a first counter i is initialised to 1=1 . and a second counter r is initialised to r=0. 

Next, processing proceeds to step 5.2, wherein an evaluation is performed to 
determine whether or not the web content will fit into the display of the intended display 
10 device. This evaluation is performed by comparing the characteristics of the content with 
the client device display capability characteristics as provided in the client capability 
profiles in the profile server 26. To generate a particular adapted version, at step 5.2 the 
evaluation is always performed against a single one of the client capability profiles, in 
respect of which an adapted version is being generated by the present instantiation of the 
15 adaptation algorithm. 

If the evaluation at step 5.2 indicates that the web content can fit into the display of 
the intended display device, and no transformation is required, then the transformation 
algorithm ends at step 5.3 and return TRUE for the function ffis_page().. 

On the contrary, if the evaluation at step 5.2 returns a negative, then processing 
20 proceeds to step 5.4 wherein an evaluation is performed to determine as to whether the 
counter i=1. It will be recalled here that when the algorithm is first started at step 5.1 the 
counter I is initialised to one, and hence the evaluation at step 5.4 returns a positive, and 
processing proceeds to step 5.6. Here, the first transformation in the form of font reduction 
is started, which takes the forms of steps 5.8 and 5.10. 
25 At step 5.8 the font size for all text in the web content to be adapted is set as 1, 

although in other embodiment other values may be chosen. Next, at step 5.10 the font 
typeface for all text in the web content is set as "verdana". These steps have the result of 
drastically reducing the size of any text objects in the web content. Following the steps 
processing proceeds to step 5.12, wherein the counter i is incremented by one, and then 
30 processing proceeds back to the evaluation at step 5.2 wherein an evaluation is performed 
to determine whether or not the transformed web content will now fit into the display of the 
intended display device. If this evaluation at step 5.2 returns a positive, i.e. the web content 
is now capable of being displayed on the intended display device, then the process 
proceeds to step 5.3 and ends. On the contrary, if further transformations are required then 
35 processing proceeds to step 5.4, wherein an evaluation is made as to whether the counter i 
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is equal to one. Here, as the count i was incremented at step 5.12 and is now equal to two, 
a negative result is returned and hence processing proceeds to the evaluation at step 5.14, 
which evaluates whether the counter i is equal to two. Here a positive value will be 
returned, whereupon processing will proceed to step 5.16. 

At step 5.16 an image reduction transformation is commenced. Within the image 
reduction transformation, first at step 5.18 a maximum possible reduction of the image is 
obtained. This is a hard coded value, for example 50%. Next, at step 5.20 an evaluation is 
made as to whether or not the maximum reduction value is greater than ten times the value 
of the counter r. It will be recalled here that at step 5.1 the value of the counter r was 
initialised at to zero, and hence on the first recursion the evaluation of step 5.20 will return a 
positive value. Here processing proceeds to step 5.22, wherein images Within the web 
content are reduced by 10%. Processing then proceeds to step 5.24 wherein the counter r 
is incremented by one, and from there to step 5.26 wherein an evaluation is made as to 
whether or not r is equal to 5. On the first recursion r will have been incremented at step 
5.24 to take the value 1 only, and hence the evaluation of step 5.26 will return a negative 
value. In this case processing proceeds directly back to step 5.2, wherein the evaluation as 
to whether or not the transformed content will fit into the display of the intended display 
device is undertaken. If this is the case then processing ends at step 5.3 although if it is not 
the case then processing proceeds via step 5.4 to step 5.14, wherein, because i has not yet 
been incremented again, a positive evaluation is returned, and the image reduction 
transformation of steps 5.18, 5.20, 5.22, 5.24 and 5.26 is applied once again. 

It will be seen from Figure 5 that the image reduction transformation can be applied 
up to five times, and each time it is applied the images are further reduced in size by 10% 
for each recursion, or by the maximum reduction value available, in the event that the 
maximum reduction available of the image is not greater than ten times the present value of 
r. In either event, however, the transformation is recursively applied five times, until the 
counter r =5. In this case, the evaluation of step 5.6 will then return a positive, whereupon 
processing will proceed to step 5.12, wherein the counter i is incremented. From step 5.12 
processing always proceeds back to step 5.2, wherein the evaluation as to whether or not 
the content will now fit into the display of the intended display device is undertaken. If the 
transformations already applied are sufficient, then this evaluation will return a positive 
value and processing will end at step 5.3. If the transformations already applied are not 
sufficient, however, and further transformations are required, then processing will proceed 
via step 5.4 and now also via step 5.15 (by virtue of i now being equal to 3) to step 5.30. 
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Here, an evaluation is made as to whether or not the counter i equals 3, and if so 
processing proceeds to step 5.32, wherein the control object reduction transformation is 
commenced by proceeding to step 5.34. 

At step 5.34 a ratio is obtained of the default screen size for the web content, and 

5 the actual screen size of the intended display device. Based on this ratio, at step 5.36 a 
size of each control object is calculated based on the ratio, by applying the ratio to the 
default size. Then, at step 5.38 an evaluation is performed at to whether or not the 
calculated size for each control object is less than the minimum allowable size for each 
object, and if not processing proceeds to step 5.42 wherein the control object sizes can be 

10 reduced based on the calculated ratio. If, however, the calculated size is less than the 
allowable minimum size of each control object, then processing proceeds to step 5.40, 
wherein the size of the control objects in the web content is reduced based on the allowable 
minimum size. The allowable minimum size is predetermined in advance. 

After either step 5.40 or step 5.42, processing proceeds to step 5.12 wherein the 

15 counter i is incremented, and thereafter to step 5.2 wherein the evaluation as to whether or 
not the transformed content will now fit into the display of the intended display device is 
performed. 

Assuming the evaluation of step 5.2 returns a negative, the counter i is now equal 
to the value 4, and hence processing proceeds via step 5.4, step 5.14, and step 5.30, to 
20 step 5.44 wherein the evaluation that i equals 4 returns a positive value. This has the result 
of causing processing to proceed to step 5.46, wherein the space removal transformation is 
commenced. 

This transformation relates to looking at object tags within the web content, and 
removing those objects which have particular tags and/or which meet other certain 

25 conditions. Therefore, at step 5.48, those objects which have tag <BR> and which are the 
first child and the last child of objects with tags <TD> and <DIV> are removed. Next, at step 
5.50, those objects with tag <BR> and which are the sibling of objects with tag <Table> are 
also removed, and then, at step 5.52 any continuous blank spaces within the web content 
display objects are reduced to a single space, and correspondingly, at step 5.54 any 

30 continuous breaks within the web content display objects are reduced to one. Finally, at 
step 5.56 the cell padding, and cell spacing values of any <table> objects are reduced to 
zero. The result of the space removal transformation is to reduce blank space in the web 
content to an absolute minimum. 

Following step 5.56 processing proceeds to step 5.12, wherein the counter i is 

35 incremented to 5. The evaluation at step 5.2 is then performed to determine whether or not 
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the transformed content will now fit into the display of the intended display device, and if so 
processing then ends at step 5.3. If not, however, processing would proceed via step 5.4, 
step 5.14, step 5.30 and step 5.44, to step 5.58, wherein the evaluation that i is equal to 5 
would return a positive. Thereafter at step 5.60 the line removal transformation is applied, 
5 which acts at step 5.62 to remove all display objects with a <HR> tag. This has essentially 
the same function as the fourth transformation previously applied, i.e. to reduce blank 
space. 

After step 5.62, processing proceeds to step 5.12 once again, wherein the counter i 
is incremented. The evaluation of step 5.2 is then performed once again, and assuming 
10 that it produces a negative result processing will proceed to step 5.64 via steps 5.4, 5.14, 
5.30, 5.44, and 5.58. The evaluation at step 5.64 will result in a positive result, as i has 
been incremented to 6. 

Therefore, following step 5.64 processing proceeds to step 5.66, wherein the 
decoration text removal transformation is commenced. This is performed at step 5.68, 
15 wherein text which had its function detected by the content analysis module 20 as being for 
decorative purposes is removed from the web content 

Following step 5.68 processing proceeds to step 5.12 wherein the counter i is 
incremented, and thereafter to the evaluation at step 5.2 as to whether or not the 
transformed content will now fit into the display of the intended display device. Assuming 
20 this is not the case, processing proceeds by the respective evaluations of steps 5.4, 5.14, 
5.30, 5.44, 5-58, and 5.64 to step 5.70, and therein as the counter i now has a value of 7 a 
positive result is returned. This causes processing to proceed to step 5.72, wherein the 
decoration image removal transformation is commenced. At step 5.74 those images or 
objects whose function was detected by the content analysis module 20 as been decoration 
25 are removed. Thus images which do not contribute to the real semantic content of the web 
content are removed. 

Following step 5.74, processing proceeds once again to step 5.12 wherein the 
counter i is incremented. Thereafter the evaluation at step 5.2 is performed as to whether 
or not the now transformed content will fit into the display of the intended display device, 
30 and assuming that this evaluation returns a negative value, processing proceeds via the 
respective evaluations of step 5.4, step 5.14, step 5.30, step 5.44, step 5.58, step 5.64, and 
step 5.70 to step 5.76, wherein an evaluation is performed as to whether i is now equal to 8. 
As this evaluation will return a positive result, processing proceeds to step 5.78 wherein the 
image replacement transformation is commenced. This starts at step 5.80, wherein, for 
35 each image display object an evaluation is performed as to whether or not the image has 
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alternative text. If this is the case, then processing proceeds to step 5.82 wherein a further 
evaluation is performed as to whether or not the total pixel size of the alternative text to the 
image is smaller than the image itself. Only if this is the case will the image be replaced 
with the alternative text. There is clearly little point in replacing an image with alternative 
5 text, if that text will take up more space than the image. Following the replacement at step 
5.84 processing proceeds to step 5.12. Similarly, if either of the evaluations of step 5.80 or 
step 5.82 return a negative value i.e. an image does not have alternative text, or the 
alternative text is not smaller than the existing image size, then processing similarly 
proceeds to 5.12. It should be pointed out here that the image transformation depicted in 
10 Figure 5 is applied to each image in turn, before processing proceeds to step 5.12 and the 
counter i is incremented. Moreover, this processing of multiple objects in the web content 
applies to each of the transformations previously described, in that each transformation is 
applied to every relevant object in the web content before the counter i allowing the next 
transformation to be applied is incremented. 
15 At step 5.12, once again the counter i is incremented, such that in this case it now 

takes the value 9. Therefore, when processing proceeds to the evaluation at step 5.2, the 
alternative condition of that evaluation that i is greater than 8 is now met, and hence the 
transformation algorithm must therefore end. 

It will therefore be seen from the above description that the transformation 
20 algorithm acts to apply each transformation to the display objects in the web content in turn, 
and evaluate after the application of each transformation as to whether or not the 
transformed web content is capable of being displayed on the display of the intended 
display device. If this is the case then no further transformations are applied, and the 
function returns TRUE to fits_pageO- 

25 

After the adaptation process is done, ie after the adaptation algorithm of Figure 6 
has ended, a different version of web content will have been generated for a particular client 
capability profile. As there are a plurality of client capability profiles, however, the algorithms 
must be run repeatedly to generate an adapted web content version for each client 

30 capability profile. Following this (i.e. after all the versions have been created) the 
Adaptation module creates the profile Ids for the versions created based on the client 
capability profiles, and stores the adapted versions and Ids in the Content Cache. The 
logical relationship of the profile ids and physical adapted content is in the form of a 
database structure cross reference link. These versions of adapted content are then ready 

35 to be retrieved and delivered to an end user upon request. 
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Thus, in the above described mode of operation, the system according to the 
embodiment of the invention acts to generate multiple adapted versions of the original web 
content in advance of user requests therefor, each version being for a particular intended 
display device with known and specified characteristics. 
5 As mentioned previously, however, the system may also operate in a web server 

capacity. This mode of operation will be described next. 

Imagine that the system is acting as a web server and is connected to the Internet. 
The server receives an http request for web content from an end user device 1. That http 
request is first routed to the client capability discovery module 12, which acts to determine 
10 the set of display characteristics of the end user device 1 , such as, for example screen size, 
colour depth, browser type, network connection, etc. 

The client capability discovery module detects device capabilities based on existing 
standards such as those put forward by M3I (please refer to Current Technologies for 
Device Independent, Mark H Butler, HP Labs Technical Report HPL-2001-83 4 April 2001, 
15 referenced previously, the relevant contents of which necessary for a full understanding of 
the present invention being incorporated herein by reference). At the present time most 
internet browsers contain end-users' device information such as browser type and version, 
IP address, screen resolution etc in the initial request sent to the web server. An end-user's 
device will start communicating with the server when the end-user enters a URL through a 
20 web browser. To get the end-user device information, client capability discovery module 12 
uses a simple Javascript™ to retrieve the client capability information sent from the end- 
user's browser and passes the information to the server through a Java servlet program. 
There follows below a sample of the Javascript™ program which get and post end-user 
device information to the server through a Java® Servlet called "clientprofile": 

25 

<script language=« , JavaScript"> 
function getdeviceinfo () { 

document. formal lent. pageUpdate. value = document .lastModlf ied ; 

document. formclient.availHeight. value = screen. availHeight; 
30 document. formclient.availWidth. value = screen. availWidth; 

document. formclient.buf ferDepth. value = screen. buf ferDepth; 

document. formclient.colorDepth. value = screen. colorDepth; 

document . f ormclient . f on tSmoothingEnabled. value = 
screen. f ontSmoothingEnabled; 
35 document. f ormclient. height. value = screen. height; 

document. f ormclient. width. value = screen. width; 

document. f ormclient. updatelnterval. value = screen . updatelnterval ; 

document. f ormclient. javaEnabled. value = navigator . j avaEnabled ( ) ; 
40 document . f ormclient . appName . value= navigator . appName ; 

document. formclient . appVersion. value = navigator. appVers ion; 
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document. formclient .cookieEnabled. value = navigator. cookieEnabled ; 
document. formclient .cpuClass . value » navigator. cpuCl ass ; 
document. formclient .mimeTypes. value = navigator .mimeTypes ; 
document. formclient . appCodeName . value = navigator . appCodeName ; 
document, formclient. platform, value = navigator, plat form ; 
document. formclient.opsProf ile. value = navigator .opsProf ile ; 
document. formclient.plugins. value = navigator.plugins ? 
document . formclient . sys temLanguage .value =navigator . systemLanguage ; 
document. formclient. userAgent. value «= navigator .userAgent ; 
document. formclient. userLanguage . value = navigator. userLanguage ; 
document. formclient. userProfile. value = navigator .userProf ile ; 
document . formclient . ac t ion= "clientprof ile" ; 
document . formclient . submit ( ) ; 

pt> 

Having determined the set of client characteristics of the end user device 1, the 
client capability discovery module 12 then passes the set of characteristics determined 
thereby to the decision module 14, which acts to compare the end user device 
20 characteristics with the set of client capability profiles stored in the profile server 26. If the 
decision module 14 can match the set of end user device characteristics with one of the 
client capability profiles, then the decision module accesses the content cache 10 which 
stores the different versions of adapted content using the profile ID of the client capability 
profile which matched to the end user device characteristics as an index thereto. The 
25 adapted version of the web content which is indexed to the profile ID of the matching client 
capability profile is then retrieved from the content cache 10, and supplied via the network 
to the end user device 1. Thus, in this mode of operation, the system is able to match end 
user device display characteristics with a set of predefined device characteristics, so as to 
determine the appropriate pregenerated adapted version of the web content to send to the 
30 end user device. 

As previously mentioned above, the system also provides a further mode of 
operation, which combines the operations provided by the previously described modes. 
Here, when an end user device 1 makes a request for web content, as before the client 
capability discovery module 12 acts to determine the display characteristics thereof, which 
35 are then passed to the decision module 14. The decision module 14 then attempts to 
match the capabilities of the end user device 1 with the client capability stored in the profile 
server 26, and if a match is found then the appropriate adapted version of the web content 
is retrieved from the content cache 10, and passed to the end user device 1 over the 
network. If, however, no match can be made, then the decision module 14 acts to operate 
40 the adaptation module 16, by passing the details of the end user device 1 relating to the 
characteristics as determined by the client capability discovery module 12 to the adaptation 



} 

15 </scrij 



31 



module 16 The adaptation module 16 then creates a new client capability profile wh,ch is 
stored in the profile server 26 corresponding to the capabllrties of the end user dev.ce 1, 
and also starts its operation In exactly the same manner as previously described when pre- 
generating adapted versions of the web content, so as to create a new adapted version of 
5 the web content adapted specifically for the end user device 1. That is, the adaptaton 
module 16 causes the content analysis module 20 to operate, which analyses the web 
content, allowing the adaptation module to run the adaptation algorithm so as to generate . 
new adapted version of the web content specifically for the end user device 1 . The adapted 
web content is then fed back to the decision module, which forwards it over the network to 
10 the end user device 1. In addition the new adapted web content is also stored in the content 
cache 10 for future use by similar end user devices to the end user device 1, if requ,red. 
Therefore, in this further mode, new versions of adapted web content can be created 
dynamically in response to a user request but are also then stored so as to be used to 
service future user requests if required. 
15 in addition to the modes of operation described above, the system also provides 

the customisation module 24. This is merely a front end to allow web authors to browse the 
various adapted versions of the web content stored in the content cache, so as to make 
further refinements or improvements thereto if required. In view of this functionality, no 
further discussion of the customisation module 24 will be undertaken. 
20 In conclusion, therefore, the system allows for different versions of web content to 

be created in advance of user requests therefor, such that user requests can then be 
serviced by matching the display characteristics of the end user device to the precreated 
versions, and hence allowing a response to be generated quickly, and with very Irttle 
computing intensity. Additionally, if required, new versions of adapted web content can be 
25 dynamically created to match a specific end user device requesting the web content, and 
the dynamically created adapted web content is then also stored for later use in serving 
future requests from similar end user devices. 

Unless the context clearly requires otherwise, throughout the description and the 
claims the words "comprise", "comprising" and the like are to be construed in an inclusive 
30 as opposed to an exclusive or exhaustive sense; that is to say, in the sense of "includ.ng, 
but not limited to". 

It is understood that in the specific embodiment described herein, particularly with 
reference to the adaptation process of Figures 5 and 6, the procedure involved an iterative 
35 process which worked its way through the analysed web content (the XML tree), splitting 
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1 An apparatus for adapbng web page content for display on an intended d,sp,ay 
Lee, comprising adaptation means for spiKing the content into a plurality of smalier web 
pages for display on said device, the adapation means being arranged ,n use to: 
' (i) s p,K the content into a piurality o, content portions, and to iterativeiy repeat stops 
(ii) to (vi) for at least one of the content portions: 

(ii) analyse the content to determine whether the size of the content port,on ,s 
suitable for display on said device; * 

(iii) if the size of the content portion is not suitable for display on sa.d dev.ee, then 
applyatleastonecontenttransformationtothecontentportion; 

(iv) analyse the transformed content to determine whether the s,ze of the 
transformed content portion is suitable for display on said device; and 

(vi) if the size of the transformed content portion is not suitable for d.sp.ay on sa.d 

device then split the content portion into a plurality of further content portions. 

2 Apparatus according to claim 1, wherein analysis steps (ii) and (iv) to determine 
whether the size is suitable comprise determining whether the content is small enough for 
display on said device. 

3. Apparatus according to claim 1 or 2, wherein the adaptation means is further 

arranged in use to: . 

In the event that step (iv) determines that the transformed content poraon ,s smalt 
enough for display on said display device, then 
S combine the transformed content portion with a further content portion to form a 

combined content portion; 

4. Apparatus according to claim 3. wherein the adaptation means is further arranged in 

analyse the content to determine whether the size of the combined content portion is 
suitable for display on said device, and if the size of the combined content portion is too 
large for said device then apply at least one content t^sformaOon to the comb,ned content 
portion. 
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5 Apparatus according to daim 4 when dependent on claim 3, wherein the adaption 
means comprising a store for content portions, and where* said step of combining two 
content portions comprises selecting the further content portion from the store, the 
adaptation means being further arranged to: 
5 analyse the content to determine whether the size of the transformed combmed 

content portion is suftable for display on said device, and if the size of the . transformed 
combined content portion is too large for said device then break up said combmed content 
portion so as to return the further content portion back Into said store. 

10 6. Apparatus according to claim 5, wherein the adaptation means is further arranged 

if the size of the transformed combined content portion is small enough for display 
on said device then combine it with a second content portion. 

15 7 Apparatus according to any preceding claim, further comprising analysis means 
arranged in use to translate the web page content into a hierarchical tree format compnsrng 
a plurality of nodes labelled so as to represent suitable locations for splitting the content ,nto 
smaller web pages. 

20 8 Apparatus according to any preceding claim, wherein the adaptation means further 
comprises a store for content portions, and wherein said steps of splitting content to form 
smaller content portions comprises adding a plurality of content portions into the store. 

9. Apparatus according to any preceding claim, wherein the adaptation means 

25 00mpn ^ ansfomiatlons ^ for s , oring , record of transformations which have been 
applied to content together with an indication of the type of content those transformafons 
have been applied to. 

30 10. Apparatus according to claim 9 when dependent upon claim 3, wherein the step of 
combining the content portions as defined in claim 3 further comprises: 

applying content transformations according to the record of transformations to the 
further content portion so as to consistently apply transformations to the same type of 
content as indicated in the record of transformations. 

35 
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11 A method for adapting web page content for display on an intended display device 
comprising splitting the content into a plurality of smaller web pages for display on sa,d 
device by performing the following steps: 

(i) splitting the content into a plurality of content portions, and iterative* repeating 
5 steps (ii) to (vi) for at least one of the content portions: 

(ii) analysing the content to determine whether the size of the content portion ,s 
suitable for display on said device; 

(iii) if the size of the content portion is not suitable for display on sa.d dev,ce, then 
applying at least one content transformation to the content portion; 

10 (iv) analysing the transformed content to determine whether the size of the 

transformed content portion is suitable for display on said device; and 

(vi) if the size of the transformed content portion is not suitable for display on sa.d 
device then splitting the content portion into a plurality of further content portions. 

15 12 A method according to claim 11, wherein analysis steps (ii) and (iv) to determine 
whether the size is suitable comprise determining whether the content is smal. enough for 
display on said device. 

13 A method according to claim 11 or 12, further comprising the step of: 
20 in the event that step (iv) determines that the transformed content portion is small 

enough for display on said display device, then 

combining the transformed content portion with a further content portion to form a 

combined content portion; 

25 14 A method according to daim 13, further comprising the steps of: 

analysing the content to determine whether the size of the combined content portion 
is suitable for display on said device, and if the size of the combined content portion >s too 
large for said device then applying at least one content transformation to the oombmed 
content portion. 
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15 A method according to 1 4 when dependent on claim 13, wherein said step of 
combining two content portions comprises selecting the further content portion from a store, 
and further comprising the steps of:: 

analysing the content to determine whether the size of the transformed comb.ned 
35 content portion is suitable for display on said device, and if the size of the transformed 
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combined content portion is too iarge for said device man breaking up said combined 
contant portion and returning the further contant portion baok into sa,d store. 

16 A method according to daim 15, further comprising the step of: 

I me size o, the trensformed combined content portion is smaii enough for d*p,ay 
on said device then combining it with a second content portion. 

17 A method according to any of claims 11 to 16, further comprising the step of: 

0 , nodes iabaiiad so as to represent suitable locattons for splitting the content ,n«o smailer 
web pages. 

18 A method according to any of claims 11 to 17, wherein said steps of splitting content 
to formamauar content porttons comprises adding a plurality o, content porttons ,nto a store. 

19 Amethodaccordlngtoanyofclaims11to18.furthercomprising: 

maintaining a record o, transformed which have been applied to content together 
with an indication of the type of content those transformations have been app.,ed to. 
, 20 Amethodaccordingtoc,aim19whendependentuponciaim13,where,nthes.epof 
combiningthecontentportionaaadermedinclaimSfurthercomprises: 

applying content transformers according to me record of transformers to me 
further Lent portion so as to consistent* apply transformers to me same type of 
content as indicated in me record of transformations. 

21 A computer progrem or suite of programs so arranged auch mat when executed by e 
Impl system it/they cause* me aystam to perform me method of any of c,a,ms 1 1 
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to 20. 



30 22. A modulated earner signal incorporating data corresponding to the computer program or 
at least one of the suite of programs of claim 21 . 

23. A computer readable storage medium storing a computer program or at least one of 
suite of computer programs according to claim 21. 
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Web Content Adaptation Process and System 

Ho<?cribed The adaptation of 

of „ content over a number of ^^^JL^apptyingtransfonn^ns 
procure vWchMegrates.be process of sp « *. ^ ^ Tne 

for example, reducing the font s«, .mages, etc) so ^ 
procedure is carded ou, ey—Hy ov r ^ en^ ^ ^ 

fitting the content into sma,.er an sm ^te Lount of white space visible on the 
^ vari0 us transformations so as to — ^ ^ m formations which 



similar objects. 
[Figure 6] 
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Fig.5 (Cont i). 
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Fig.5 (Cont ii). 
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This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
/OTFAJiED TEXT OR DRAWING 



LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 



□ GRAY SCALE DOCUMENTS 




